arXiv:1509.05676v2 [math.FA] 19 May 2016 


PRE-IMAGES OF EXTREME POINTS OF THE 
NUMERICAL RANGE, AND APPLICATIONS 

ILYA M. SPITKOVSKY, STEPHAN WEIS 

Abstract. We extend the pre-image representation of exposed 
points of the numerical range of a matrix to all extreme points. 
With that we characterize extreme points which are multiply gen¬ 
erated, having at least two linearly independent pre-images, as the 
extreme points which are Hausdorff limits of flat boundary portions 
on numerical ranges of a sequence converging to the given matrix. 
These studies address the inverse numerical range map and the 
maximum-entropy inference map which are continuous functions 
on the numerical range except possibly at certain multiply gener¬ 
ated extreme points. This work also allows us to describe closures 
of subsets of 3-by-3 matrices having the same shape of the numer¬ 
ical range. 


1. Introduction 

We denote the set of complex d x d matrices by d E N, with 
identity matrix 1. The numerical range of A G is the subset 

W(A) := {fAix) I a; G 

of the complex plane C where —?■ C, a: e-)■ x*Ax is the numer¬ 

ical range map dehned on the unit sphere S'C'^ = {a: G | x*x = 1} 
of C'^. On C = we use the standard Euclidean scalar product 
(a,/3) = ^{afd) for a,/3 G C. The numerical range is a compact and 
convex subset, the convexity statement being known as the Toeplitz- 
Hausdorff theorem [3I1II5], more recent work includes pen Eg. 

We are interested in points a E W (A) which are 

• extreme points, that is a cannot be written as a proper convex 
combination of points in W{A), 

• multiply generated [251 ETl that is fA^{a) contains at least two 
linearly independent vectors. 

Part of our interest in these points comes from a continuity problem 
in operator theory. The (multi-valued) inverse : W (A) —)■ SC‘^ of 
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fA is strongly continuous at a G (A) if for all x G the function 

Ja is open at x. Strong continuity holds on W (A) except at certain mul¬ 
tiply generated extreme points. A round boundary point (see [7l [23l |24] 
and Lemma 6.1 of [29]) is an extreme point of W{A) which lies on a 
unique supporting line {a G C | («,e'^) = /i(e'®)} where 0 G M, 
is an outward pointing normal vector and /i,(e'®) = maxQg^(^)(a, e‘®). 
The map is strongly continuous on W (A) except possibly at multi¬ 
ply generated round boundary points [7| but may be strongly continu¬ 
ous also there. A characterization of strong continuity of in terms 
of analytic eigenvalue curves [23] shows that /A^ has at most hnitely 
many discontinuity points. The corresponding eigenvector curves will 
be discussed in Coro. 12.51 where we consider their intersection with 
pre-images of extreme points under Ja- 

Further interest in multiply generated extreme points comes from an 
optimization problem in quantum mechanics. The maximum-entropy 
inference, going back to ideas by Boltzmann, is a method to select 
a quantum state from the expected values of a collection of physical 
quantities represented by hermitian matrices when all other informa¬ 
tion about the state is ignored [18]. The numerical range W{A) is the 
set of expected valnes of two hermitian matrices given implicitly by the 
real part 3?(A) = ]^[A-\- A*) and imaginary part A(A) = A(^A — A*) of 
A [21 [29] . The inference map is 

p* : W{A) — )■ M-di o argmax{S'(p) | p G tr(Ap) = a} 

with state space Aid consisting of all positive semi-definite matrices 
of trace one and von Neumann entropy S{p) = — trplog(p). A dis¬ 
continuity of p* may occur EH- Note, however, that for normal A its 
numerical range W{A) is a polytope, 3fJ(A) and 9(A) commute, and p* 
is continuous |35) . So, a discontinuity belongs to the proper quantum 
domain where it is discussed in the context of quantum phases ^m\. 

As it happens, the described continuity problems are equivalent. The 
map p* is continuous at a G IF (A) if and only if /A^ is strongly contin¬ 
uous at a [36], and p* is indeed discontinuous at all isolated multiply 
generated round boundary points of IF (A), see Sec. 6 of [21] • Further, 
it is known that multiply generated round boundary points are isolated 
for d = 3. Calling A G unitarily reducible if A is unitarily similar to 
a block diagonal matrix with two proper blocks and unitarily irreducible 
otherwise, multiply generated ronnd boundary points are isolated for 
all irreducible matrices of size d < 5 |23| . But, the whole boundary 
of IF (A) may consist of multiply generated round boundary points for 
redncible 4-by-4 matrices and irredncible 6-by-6 matrices |23| . 
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Here we study these continuity problems from a topological perspec¬ 
tive of a variable matrix A G M^. To this end we define a flat boundary 
portion of W (H) as a maximal proper segment in the boundary of W (H) 
[HIH[281 [9]. If a G hT(H), then we say that a flat boundary portion is 
born at a if there exists a sequence C converging to A such 

that a sequence (sj)i6N of flat boundary portions converges to {a} in 
the Hausdorff distance, the segment Sj being a flat boundary portion 
of W{Ai), i G N. 

The birth of a flat boundary portion was conjectured in Sec. LB of 
[57] to be a condition for a discontinuity of p*. The above discussion and 
the following theorem prove that the birth of a flat boundary portion at 
a round boundary point is a necessary condition. It is not a sufficient 
condition for d > 4 (d > 6 if H is unitarily irreducible). 

Theorem 1.1. Let a be an extreme point ofW{A). For aflat boundary 
portion to be born at a it is necessary and sufficient that a is multiply 
generated. 


The necessity in Thm. 11.11 follows from properties of the Hausdorff 
distance (Sec. [3|). We prove the sufficiency in Sec. [2] using a newly 
developed representation of extreme points in terms of pre-images. 
Pre-images were well-understood |3T] for exposed points which can be 
represented as the intersection of W{A) with a supporting line. The 
number h(e^®) defined above is the maximal eigenvalue of the hermitian 
matrix cos(d)3?(H) -|- sin(d)Q'(H), the corresponding eigenspace is the 
demanded pre-image. 

Some extreme points may fail to be exposed. Consider, e.g., the 
convex hull of a circle and a point outside the circle, which is realized 


as the numerical range of A = 


0 2 0 
0 0 0 
0 0 2 


Then the intersections of the 


two tangents from the point to the circle with the circle are extreme 
but non-exposed points. We will obtain the pre-images of non-exposed 
points by viewing them as exposed points of some flat boundary por¬ 
tion whose supporting lines are given by directional derivatives of h [3]. 
Notice that viewing non-exposed points as exposed points of a convex 
subset is a familiar idea in convex geometry [T51I55] (cf. poonem), geom¬ 
etry of quantum states [52], and in the theory of exponential families 
[SllM] (cf. access sequence). 

Finally, in Sec. ID we combine observations from [TTl l22ll2Tll28ll251l26] 
with Thm. 11.11 to compute closures of subsets of 3-by-3 matrices hav¬ 
ing the same shape of the numerical range. In particular, we prove in 
Sec. [5] that a 3-by-3 matrix A with W (H) having a non-empty inte¬ 
rior lies in the closure of irreducible matrices with elliptical numerical 
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range and in the closure of irreducible matrices with flat portion on the 
boundary of their numerical range if and only if W (A) is an ellipse with 
an eigenvalue of A on the boundary. Remarkably, these are precisely 
the 3-by-3 matrices A where and p* have a discontinuity [71 1231 [29] , 

Acknowledgements. SW cherishes the memories of discussions with Leiba Rodman 
during the workshop LAW’14 in Ljubljana, Slovenia, June 4~12, 2014. He also ap¬ 
preciates financial support by a Brazilian Capes scholarship. IS acknowledges the 
support by the Plumeri Award for Faculty Excellence from the College of William 
and Mary and by Faculty Research funding from the Division of Science and Math¬ 
ematics, New York University Abu Dhabi (NYUAD). Both authors are especially 
thankful to the latter for hosting SW’s research visit to NYUAD in December’14. 


2. Representation of extreme points 

We provide a pre-image representation of extreme points of the nu¬ 
merical range. We rewrite it in terms of eigenvalue curves and we use 
it to prove one part of Thm. 11.11 

Recall that the relative interior of a subset C* of a Euclidean space 
(E, (•, •)) is the interior of C in the affine hull of C. A face of a convex 
set C is a convex subset F G C which contains every closed segment in 
C the relative interior of which intersects F. The element of a singleton 
face is called extreme point. The support function of a convex body C, 
that is a compact convex subset of E, is 

hc(u) := ma.x{x,u), u G E. 
xec 

The supporting hyperplane of C with outward pointing normal vector 
M G E \ {0} is the set of a: G E on the hyperplane {x, u) = hc{u). The 
intersection of this hyperplane with C is the exposed face 

argmax(a:, u) 

x£C 

with outward pointing normal vector u. If {«} is an exposed face, then 
a is called an exposed point. All exposed points are extreme points. 
The remaining extreme points are called non-exposed points. 

Notice that a supporting hyperplane of W{A) is a supporting line. 
Since W{A) has (real) dimension at most two, a subset is a one¬ 
dimensional face if and only if it is a flat boundary portion, and every 
flat boundary portion is an exposed face. Moreover, the boundary of 
W (A) is the disjoint union of extreme points and relative interiors of 
flat boundary portions. So, every extreme point a is an exposed point, 
an endpoint of a flat boundary portion, or both. It is crucial in the 
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sequel that every non-exposed point of W{A) is an exposed point of 
some flat boundary portion of W{A). 

Let 

A{e) := ^{e-^^A) = cos(0)3fJ(A) + sin(0)$>(A), 6 eR. 

We denote by Xjj^{ 6 ) the eigenspace of A{ 6 ) corresponding to the max¬ 
imal eigenvalue Am(^) of A{ 6 ). An easy computation gives 

(2.1) {fA{x),e^^) = ^{x*Axe~^^) = fA{ 0 ){x), x G S'C'^. 

The maximal eigenvalue Am(6*) of A{ 6 ) has the geometric meaning of 
support function of the numerical range, see Sec. 4 of |31) . 

(2.2) hw(A){e}^) = Am(^)- 

Let Fa{0) denote the exposed face of W{A) with outward pointing 
normal vector 6 G M. 

Lemma 2.1. The point /a^x), x G SC^, lies in the exposed face Fa{9), 
9 eR, if and only if x E Xm(0). 

Proof: We consider the orthogonal direct sum C'^ = X^{9)(BX„i{9)^. 
A unit vector x E SC^ has a unique decomposition x = y + z for 
y E X^{9) and z G X^{9)^. So 

(/^(x), = \^{9) + z*{A{9) - \r.{9)t)z 

has the maximal value Am(6') only for z = 0. This proves the claim. □ 

Non-exposed points of the numerical range are not addressed in 
Lemma Ml To describe them in terms of pre-images, we view them 
as exposed points of a flat boundary portion. The two endpoints p±{9) 
(not necessarily distinct) of the exposed face Fa{9) are characterized 
by their membership in Fa{9) and by the equality 

{p±{9),±iF^) = hF^(0)(±ie'®). 

To evaluate the support function of Fa{9) we consider the directional 
derivative of a function / : M™' —)■ M” at a point u G M™' in direction 
V E M™, which is dehned by 

f\u; v) := ^lim \{f{u + tv) - /(«)), 

if the limit exists. Given a convex body C* in a Euclidean space E 
and M G E \ {0}, the support function he of the exposed face G of C 
with outward pointing normal vector u is heiv) = h'(j{u-,v), n G E, see 
Section 16 of 

Since the chain rule does not hold for directional derivatives, we need 
to go into some detail in the following proof. 
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Lemma 2.2. The support function of the exposed face Fa{0), 6^ G M, 
has the values /ii?^(e)(± ie*®) = A'j^(6*;±l). The endpoints of Fa{ 0) are 

p±(«) = e“'(A„(«)±iA;„(»i±l)), 

Proof: The equation for support functions of exposed faces, recalled 
in the previous paragraph, proves 

hir^(e)(±ie‘®) = ± i e‘®). 

For all s,t > 0 with s = arctan(t), and writing h = hw{A), we have 

± t i - h{F^)) = - h{F%) + g{s) 

where ^'(s) —?■ 0 for s —)■ 0, because h is positive homogeneous of the 
hrst degree [3]. Taking the limit f —)■ O’*" gives 

±ie‘®) = {hw{A) ° ^^{6; ±1). 

The values of follow from fl2.2p which provides h]y(^A) o = 

Xm{d)- The formula for the endpoints is obvious. □ 

We are ready to describe p±{0) in terms of its pre-image under fA- 
Noticing A'{6) = where denotes derivative with respect 

to 6, an easy calculation shows that for all y G 5'C'^ 

(2.3) {fA{y)pF^) = fA'{e){y) 

holds. Equations fl2.ip and fl2.3p show that for all y G S'C'^ we have 

(2.4) fA{y) = F\fA{e){y) + ifA'{e){y))- 

We denote by the compression of i? G Md onto a subspace X C C'^, 
that is B\x is the restriction of PBP to X where P is the orthogonal 
projection onto X. For 6* G M dehne : C —)■ M, a i-G- {a,ie^^) and 
he : R -)■ C,77 i-G- e‘®(Am(6') ir^). 

Theorem 2.3. For all 6 & M. the maximal (respectively, minimal) 
eigenvalue of A'{9)\x,n{d) -^m(^i 1) (respectively, For 

all unit vectors x G X^{9) we have fA'{e)\x,„(e)^^) — 9e ° Ia^x). The 
map (701 Fa (0) ^ Pa{9) -A- s is a bijection to the segment s C R with 
endpoints ±A'j^(6';±l), the inverse is he\s- 

Proof: The pre-image of the exposed face Fa{9) is by Lemma 12.11 
equal to S'C'^ Fl Xni(h). By dehnition of the support function of Fa{9) 
we get for all x G S'C'^ fl Wm(h) the inequalities 

-hFA{e){-iF^) < {fA{x),iF^) < hip^(0)(ie‘®). 
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Both equalities are attained because Fa{0) is compact. Using Lemma[22] 
and fl2.3p . the above inequality is equivalent to 

which shows that the hermitian operator A'{9)\x^ has minimal eigen¬ 
value —— 1) and maximal eigenvalue A'nj(6'; 

The numerical range of the hermitian operator A^{6)\x^{e) is the 
segment s between its extreme eigenvalues ±A'j^(6'; ±1). By fl2.2p and 
fl2.4p we have for all x G S'C'^ fl Xm(6*) 

fA{x) = F\Xm{0)+ ifA'{e)ix)). 

Since fA'{e){.x) = (/A(x),ie‘^) holds, again by (12.3p . the functions go 
and hg have the claimed properties. □ 

We can now compute the pre-images of all extreme points. Thm. 12.31 
proves that ±A^(6'; ±1) is an eigenvalue of A'{9)\x^- We denote the 
corresponding eigenspace by X±{6). 

Corollary 2.4. The point Ja^x), x G is the endpoint p±{6), 

0 G M, o/ the exposed face Fa{ 0) if and only if x & X±{6). 

Proof: This follows from Thm. 12.31 and from Lemma 12.11 applied to 
the exposed points ±A^(0; ±1) of the numerical range of the compres¬ 
sion Al'(0)|x„,(0). □ 

We describe the eigenspace X±{9) of A'{9)\x^ in terms of eigen¬ 
value curves {Xk{9)}f.^i of A{9) and mutually orthogonal eigenvectors 
{xk{(^)}k=i which depend real analytically on the parameter 9 |27] . 
Thus A{9) = ^k{G)xk{(^)xk{G)* and we have for /c = 1,..., d 

Xk{9) = fA{e){xk{9)), d G M. 

We recall that the derivative of Xk with respect to 9 (see Lemma 3.2 of 
[T9] and Sec. 5 of [11]) is 

(2.5) Xii9) = fAgo){xk{9)), 9eR. 

We define 

U0) :={te{l,...,d}\Xi{9) = >^m{0)} 

and 

I±{e) := {7 G UO) I m = ±A',,(d;±l)}. 

Corollary 2.5. The span of {xk{9) \ k G /±(^)} is X±{9), 6* G M. 
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Proof: By Thm. 12.31 the eigenvalue ±A'j^(0; ±1) of the hermitian op¬ 
erator A'{6)\x^{e) is extreme. Hence, fl2.5p shows that X±{6) contains 
all eigenvectors Xk{9) of A{9) with k G I±{9). For all k G Ira{9) \ I±i9) 
we have by (12.Sp 

x,{9rA'{9)x,{9) = X[{9) ^ ±XM±1). 

Therefore X±{9) is the span of {xk{9) \ k E I±{9)}. □ 

We hnish the section with an application. See the next section for 
the dehnition of the Hausdorff distance. 

Proposition 2.6. Let a be a multiply generated extreme point of the 
numerical range W{A). Then a flat boundary portion is born at a. 

Proof: Without loss of generality let a = Pa{9) for some 0 G M and 
some a G {+,—}• Since X„{9) belongs to the eigenspace Wm(6*) of A{9) 
corresponding to the maximal eigenvalue Am(6') of A{9), the equation 
(12.4p implies that for all x G SC^ fl X„{9) 

fAix) = e'%X^{9) + ifA'(g)ix)). 

Let P denote the orthogonal projection onto X^{9). Coro. 12.41 shows 
that X^{9) is the pre-image of a under fA- Hence, there exists A G M 
such that A'{9)\x„(e) = XP\x„{e)-i for otherwise fA^X^jif))) could not be 
a singleton. As the numerical range of XP\x„{e) is {A}, Thm. l2.3l proves 
a = e‘®(Am(6') + i A). 

By assumption, a is multiply generated, so dimc(Xo-(6*)) > 2 holds. 
Choose any hermitian matrix H with H\x^{e) not being a scalar multi¬ 
ple of the identity and denote its maximal eigenvalue by /i_|_ and mini¬ 
mal eigenvalue by /i_. Let e > 0, and dehne 

A, := P^(A(9) + eP + i{A'{9) + eH)). 

The numerical range of {A'(9) -|- eH)\x^{d) is the segment between the 
two distinct reals A -I- ep±. Since Xa-{9) is the eigenspace of 
corresponding to the maximal eigenvalue Am(6') -|- e of the 

exposed face Fw(aP){9) of W{Afj has by Thm. 12.31 the endpoints 

e'^(Am(6*) + e + i(A -|- ep±)). 


Clearly, Fw{a^){9) is a flat boundary portion of W{Afj which converges 
for e ^ 0 in the Hausdorff distance to {a} while A^ converges to A. 
This completes the proof. □ 
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3. Hausdorff distance 

We address the Hausdorff convergence of numerical ranges and show 
that a flat boundary portion can only be born at a point of the numer¬ 
ical range which is multiply generated. 

We denote by | • | the Euclidean norm in a Euclidean space (E, (•, •)). 
The set of non-empty compact subsets of E is a complete metric space 
with respect to the Hausdorff distance 

dniK, L) := max(maxmin \y — x\, max min \y — x|), 

x^K y£L y^L x£K 

where 0 7^ it', L C E are compact. The set of all convex bodies in E is 
closed in the metric space of compacts by Thm. 1.8.5 in |30| . 

We first recall a condition for the Hausdorff convergence. 

Remark 3.1 (Thm. 1.8.7 in [30]). A sequence (it'j)jgi^ of convex bodies 
in E converges to a convex body 77 in E if and only if the following two 
conditions hold. 

(1) Each point in K is the limit of a sequence with Xi G Ki for 

i e N. 

(2) The limit of any convergent subsequence with Xi^ G 77q for 

j G N belongs to K. 

The simplest example of a non-converging sequence illustrates that 
subsequences are needed in (2). Consider for example K = {0} and 
the sequence dehned by K 2 i = K and K 2 i+i = [0,1], i G N. 

Remark 3.2 (Hausdorff convergence of the numerical range). 

(1) Remark 3.1 and the continuity for every x G of the linear func¬ 
tion Md ^ C, A i-A x*Ax in A prove that 

^ W{Ai)^^W{A), (4).6NCMrf. 

Here, the hrst limit is in any norm, the second limit is in the Haus¬ 
dorff distance. 

(2) The support function (12.21) of W{A) in the direction is jointly 
continuous in A G Md and 6* G M because the maximal eigenvalue of 
a hermitian matrix is a continuous function of the matrix. There¬ 
fore, if 6^ = limj^oo^'i, (^'OieN C M, A = limj^ooAj, (Ai)*^^ C M^, 
and if the exposed faces i74.(6*i) converge in the Hausdorff distance, 
then their limit is a subset of Fa{6 ). 

We now complete the proof of Thm. 11.11 Recall from Section |2l third 
paragraph, that a flat boundary portion of W (A) is an exposed face of 
dimension one. 





10 


Pre-images of extreme points 


Lemma 3.3. If a flat boundary portion is born at a & W{A), then a 
is multiply generated. 

Proof: Let (sj)jgN be a sequence of segments, each Sj being a flat 
boundary portion of W{Ai), z G N, and assume that A = linij^oo^t 

and that Si {a} converges in the Hausdorff distance. The segments 
Si are exposed faces of W{Ai). So, for every z G N there exists 6^4 G M 
such that e'®* is an outward pointing normal vector of Sj. Lemma [ 2.11 
implies 

fA^{s,) = SC^nH, 

for some subspace Hi C C'^. The segments Sj are no singletons so 
dimc(Lfj) > 2. This proves that f^^{si) contains two orthogonal vec¬ 
tors Pi, Qi. The compactness of SC^ proves pi. p and q 

for suitable p,q & SC^ and a suitable subsequence. Note that p and q 
are orthogonal. Since /a(p) is jointly continuous in A and p, and since 
fAiiPi) £ Si for all z G N, this shows 

|/a(p) -a\= lim \ fAi.{Pii) - a| < limsupmax \fl - a\ 

J^oo 0 g^si. 

< limsupcii^(sj^., {a}) = 0. 

j->oo 

Similarly, /^(g) = ct holds and hence a is multiply generated. □ 

Before turning to other subjects we add a statement about the Haus¬ 
dorff convergence of ellipses in the plane. Thereby an ellipse is the zero 
set of a real quadratic form g : —)■ M, x 1 —)■ x*Sx + b*x -|- c where 
S G M 2 is a real symmetric and positive dehnite 2-by-2 matrix, & G R^, 
and c G R. We call the level set {x G R^ | /(x) < 0}, which is the 
convex hull of the ellipse, also an ellipse. 

Remark 3.4. If a sequence of ellipses in R^ converges in the Hausdorff 
distance then the limit is an ellipse, a segment, or a point. This can be 
proved by representing each ellipse as a linear image of the unit disk 
and by using Rem. 13.11 and compactness arguments. 

4. 3-BY-3 MATRICES 

Kippenhahn’s [22] representation of the numerical range oi a d x d 
matrix A in terms of the convex hull of a planar real affine algebraic 
curve provides a classihcation of the possible shapes of the numerical 
range of a 3-by-3 matrix whose equivalence classes are well-understood 
[ 2 TI |28] in terms of matrix entries and of spectral data of A. We 
compute the closures of these equivalence classes. 
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Using the hermitian real and imaginary parts oi A = 3?(y4) + iQ'(y4), 
the homogeneous polynomial 

F{yo, yi, 1 / 2 ) := det(i/ol + i/i3f?(A) + i/ 2 S(A)) 

dehnes a curve Sa ■= {{yo ■ yi ■ 1/2) e PC^ | F(1/0,1/1,1/2) = 0} in 
the complex projective plane PC^. The dual curve 5*^4 is an algebraic 
curve in PC^ which consists roughly speaking of the tangent lines to Sa 
[121 [in]. Thereby, a line {(1/0 : yi : 1/2) G PC^ | Xoyo+Xiyi+X 2 y 2 = 0} is 
identihed with the point (xq : Xi : X 2 ) G PC^. The boundary generating 
curve *S'^(M) of A is the real part of the affine component xq = 1 of 
One can show that the convex hull of «S'j 4 (R) is the numerical range 
[a [ 22 ]. See also |16) for further algebraic context of the numerical 
range. 

We will discuss boundary generating curves separately for unitarily 
reducible and irreducible 3-by-3 matrices. We will say that a G C 
is a normal eigenvalue of A is there exists a non-zero vector x such 
that Ax = ax, A*x = ax. The 3-by-3 matrix A has a normal eigen¬ 
value if and only if A is unitarily reducible. Every normal eigenvalue 
contributes as a point to the boundary generating curve, see §7 of |22j . 

Remark 4.1. A complete list of boundary generating curves for a 
unitarily reducible matrix A G M 3 is 

( 1 ) three normal eigenvalues (not necessarily distinct), 

( 2 ) one normal eigenvalue and one ellipse. 

The corresponding shapes of W{A) are (1) a point, a segment, or a 
triangle and ( 2 ) an ellipse or the convex hull of an ellipse and a point 
outside the ellipse. 

Remark 4.2. A complete list of boundary generating curves for a 
unitarily irreducible matrix A G M 3 is 

( 1 ) an ellipse and a point inside the ellipse, 

( 2 ) a degree four curve with a double tangent, 

(3) a degree six curve consisting of two nested parts one inside another, 
the outer part having an ‘ovular’ shape. 

Following [25] we denote the sets of irreducible matrices (1) by T 3 , 
their numerical ranges being ellipses, and the set of irreducible matri¬ 
ces ( 2 ) by F 3 , their numerical ranges having a flat boundary portion. 
Further, we denote the set of irreducible matrices (3) by 03 and the 
set of reducible 3-by-3 matrices by TZ^. This gives a disjoint union 

(4.1) M 3 = 77.3 U £^3 U F 3 U 03. 
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We begin with some observations following from already published 
results though not necessarily explicitly stated there. 

Remark 4.3. 

(1) The set of reducible matrices TZ^ is closed and nowhere dense in M 3 . 
In fact, the analogous statement holds for all matrix sizes d G M, 
see Problem 8 in na. 

(2) The set S 3 is closed relative to the set of unitarily irreducible matri¬ 
ces, and nowhere dense. The closedness follows from Thms. 2.3, 2.4 
of | 2 T] which provide a constructive criterion for a 3-by-3 matrix to 
have an elliptical numerical range. Indeed, these conditions are in 
form of equalities which persist under taking limits, while staying 
within the set of non-normal matrices. This implies further that 
the set of 3-by-3 matrices who have an elliptical numerical range is 
closed relative to the set of non-normal matrices. The fact that S 3 
is nowhere dense was derived from the same criterion in | 2 S], see 
Prop. 3.1 there. 

(3) The set J ^3 is closed relative to the set of unitarily irreducible ma¬ 

trices and nowhere dense. The closedness follows from Prop. 3.2 of 
| 21 ) . according to which a unitarily irreducible matrix A belongs to 
J -3 if and only if M 3 fJ( 74 ) + -|- wl has rank one for some real 

u,v,w. It remains to invoke the lower semi-continuity of the rank 
function. The fact that J ^3 is nowhere dense can be seen from an 
alternative description of this class, provided by [251 Theorem 1.2]. 

(4) From (14.ip and already established (l)-(3) it directly follows that 
the set O 3 is open and dense in M 3 . 

The statements (2) and (3) can be rehned using Kippenhahn’s clas- 
sihcation in Remarks 14.11 and 14.21 and the property that the numerical 
ranges of a converging sequence of matrices converge to the numerical 
range of the limit matrix (Rem. 13.2p . 

Lemma 4.4. 

(1) If A E M 3 \ S 3 is the limit of a sequence in S 3 , then A is reducible 
and W (R) is an ellipse, a segment, or a point. 

( 2 ) If A E M 3 \ is the limit of a sequence in IF 3 , then A is reducible. 
If W (R) is an ellipse then the normal eigenvalue of A lies on the 
boundary of W (A). 

Proof: (1). Rem. I4.3f 2l shows that A is reducible and that W{A) 
is an ellipse if A is not normal. The numerical range W{A) cannot 
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be a triangle because the limit of a sequence of ellipses is an ellipse, a 
segment, or a point (Rem. 13.4p . 

(2). Rem. 14.31 31 shows that A is reducible. The Blaschke selection 
theorem, Thm. 18.6 of [30], shows that there is a sequence (y4j)jgp} C -F 3 
converging to A with flat boundary portions Sj of W{Ai), i G N, such 
that (sj)jgN converges in the Hausdorff distance. If W{A) is an ellipse, 
then Rem. 13.21 21 shows that the limit is an extreme point a oiW (A) 
and Lemma 13.31 proves that a is multiply generated. Now Thm. 3.2 
of [23] proves that a is an eigenvalue of A which is in fact a normal 
eigenvalue, see Theorem 1.6.6 of [17| . □ 


5. Reducible 3-by-3 matrices 

In this section we describe the intersection of the (norm) closures T 3 
of T 3 and J ^3 of with the set of reducible matrices. 

To simplify possible limit points of sequences we consider the equiv¬ 
alence of matrices A G Md modulo 

• unitary similarity, 

(5-1) • substitution of R by a matrix in Mrf while preserving 

the linear span of 1 ,3fJ(A), 9'(A). 


The numerical range W{A) and the boundary generating curve 5'^(M) 
are invariant under the action of the unitary group. The substitutions 
are realized by the group of invertible affine transformations of the 
real plane, whose action commutes with the operators A\-^ W (A) and 
A i-A S'^(M), see §2.4 and §4.18 of [22]. Hence, every equivalence class 
fIS.ip is a subset of an equivalence class in Kippenhahn’s classification 
provided in Remarks 14.11 and 14.21 In particular, the blocks TZ^, S 3 , 
J^ 3 , and O 3 in the partition fl4.ip of Md are invariants of flS.ip . Since 
the action of a fixed unitary (respectively, affine transformation) is a 
homeomorphism of Md, the closures S 3 and J ^3 are invariants, too. 


Lemma 5.1. Any matrix A G M 3 such that 3fJ(H) and 9'(R) commute 
is equivalent modulo (15.ip to the diagonal matrix 

(5.2) 0, diag[0. A, 1] for 0 < A < |, or diag[0,1, i]. 


Any reducible matrix A G M 3 such thatiR.{A) and 9'(A) do not commute 
is equivalent modulo (15.ip to 


(5.3) 


0 2 0 
0 0 0 
0 0a 


a > 0 . 


No two of the matrices in (O or (15.3p are equivalent modulo (I5.ip . 
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Pre-images of extreme points 


Proof: We shall simplify A using transformations (15. Ij) . Since A is 
unitarily reducible we can assume that its real part is X © a and its 
imaginary part Y (B b where X, Y are self-adjoint 2-by-2 matrices and 
a, b reals. Using an affine transformation we can assume that tr(X) = 0, 
tr(y) = 0, a > 0, and 6 = 0. Thereby, the real and imaginary parts of 
the original matrix A commute if and only if X and Y commute. 

If X and Y commute then, using unitary similarity, we assume that 
X and Y are scalar multiples of diag[l, —1]. The three cases A = 0, 
(A 0, Y = 0), and (A ^ 0, Y 7 ^ 0) lead via affine transformations to 
the cases of fl5.2p in the same order. 

If X and Y do not commute then by adding scalar multiples of 
U © 0 to X © a we assume that X and Y are orthogonal with respect 
to the Hilbert-Schmidt inner product M 2 x M 2 —)■ C, {x,y) 1 —)■ tT{x*y). 
Normalizing and applying an SU{2) = SO{3) rotation we obtain a 
matrix of the form fl5.3p . 

To see that no two of the matrices described in fl5.2p and fl5.3p are 
equivalent modulo fl5.ip recall that the boundary generating curves of 
two equivalent matrices are affinely isomorphic. The boundary gener¬ 
ating curves of the described matrices are one, two, or three points on 
a line, the vertices of a triangle, or the union of a circle and a point. 
This makes sure that no two of them are affinely isomorphic. □ 


We shall compute the intersections of S 3 with the set of reducible 
matrices. It is shown in [28], Sec. 2 , that the boundary generating 
curve of the matrix 


A' 


a X y 
0 b z 
0 0c 


a, 6, c,x,y, z & C 


consists of an ellipse and a point if and only if d := \x\^ + ||/P + \z\‘^ > 0 
and the number 


(5.4) A := (c|xp + b\y\^ + a\z\‘^ — xyz)/d 

coincides with one of the eigenvalues a, b, c of A'. In this case the two 
other eigenvalues of A' are the foci of an ellipse with minor axis of 
length \fd, and the boundary generating curve of A' is the union of 
this ellipse and of A. Moreover, the eigenvalue A is a normal eigenvalue 
if and only if A is reducible. 

Observe that the boundary generating curve of the following matrix 
is the union of the unit circle and a point lying on or inside the unit 
circle. So the numerical range is the unit disk. 


020 

000 

0 0 a 


lies in S 3 . 


Lemma 5.2. If a E [0,1] then the matrix 
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Proof: If a > 0 then the matrix in the above statement equals a-A{^) 
where 

^( 7 ) := 000 , 7 > 2 . 

Lo 0 1J 

Hence, it suffices to prove ^( 7 ) G S 3 for real 7 > 2. The case a = 0 
would follow by taking the limit a —?■ 0. We define 


M{a,P) : = 


a (l-a)(l+^27^ Q, 

0 a —ajS 

0 0 1 


a e M,/3 > 0 . 


Choose /3 > 0 such that 7 = (1 + (3“^)/(3. Then the matrix ^( 7 ) is the 
limit a —)■ 0 of M{a,(3). It suffices to prove M{a,(3) G T 3 for > 0 
and for a in a neighborhood of zero. The discussion in the paragraph 
of fl5.4p shows that for a > 0 the numerical range of M{a,(3) is a disk 
centered at a. 

Let us prove that M{a,(3) is unitarily irreducible for all /9 > 0 and 
a ^ {0,1}. For a different from one, the latter is a simple eigenvalue 
(for M, as well as for M*), and 63 = [0,0,1], is the respective eigen¬ 
vector for M*. By contradiction, if M were unitarily reducible, then 
63 would lie in its reducing subspace, say L. Applying M to 63 and 
assuming a ^ 0 the vector [1,-/3, 0] also lies in L, and by applying 
M again the vector ei = [1,0,0] is in L. So, unless (3 is zero, L is 
3-dimensional, proving that M is unitarily irreducible. □ 


Lemma 5.3. //O < A < ^ then the matrix diag[0. A, 1] lies in S 3 . 

Proof: It suffices to show that for all A G [0, A] the diagonal ma¬ 
trix diag[0. A, 1] is a limit of matrices obtainable from A{a) := 000 , 

a G [0,1], considered in Lemma 15.21 via transformations flS.ip . To 
this end, introduce the matrix resulting from the affine transforma¬ 
tion {x,y) 1-7 ((1 — x)/2,ey), e > 0, applied to U*A{1 — 2X)U, where 

is unitary. A straightforward computation shows 
which converges for e ^ 0 to diag[0. A, 1]. □ 


U := 


■ 1 0 1 
10-1 
F2 Lo 0 
1 r 0 0 —e 
that it equals 


0 A 0 
e 0 1 j 


Lemmas I4.4f li. 15.1115.21 and 15.31 show the following. 

Lemma 5.4. A reducible 3-by-3 matrix lies in S 3 if and only if its 
numerical range is an ellipse, a segment, or a point. 

We shall compute the intersection of J ^3 with the set of reducible 
matrices. 

Lemma 5.5. A reducible 3-by-3 matrix A lies m J ^3 if and only ifW{A) 
is not an ellipse having the normal eigenvalue of A in the interior. 
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Pre-images of extreme points 


Proof: Lemma 14.41^ 2) excludes reducible matrices A from when 
W{A) is an ellipse with the normal eigenvalue of A in the interior. Let 
us show that all other reducible matrices he in 

We can assume that A is a matrix listed in Lemma 15.11 Let A 
be of the form fl5.2p . Clearly 0 G Otherwise, if A 7 ^ 0, then 
A = diag[0, A, 1] for A G [0, |] or A = diag[0, l,i]. In both cases 
we dehne for e > 0 a matrix M{e)j^k '■= Aj^^ + ie, j,k = 1,2,3. In 
the first case ‘A{M{e)) and in the second case 3?(M(e)) has a multiple 
eigenvalue. If e > 0 then these matrices are irreducible because their 
real and imaginary parts have no common eigenvectors. This proves 
AIG J 3 . 

Let A be of the form (15.3p with a > 1. It suffices to consider a > 1, 
the case a = 1 would follow by taking the limit. We apply transfor¬ 
mations (15.ip . namely the affine transformation —)■ [x,y] i-G- 

[x -|- y\/a? — 1, y] followed by a unitary transformation A 1 -^ u*Au with 


unitary 


u := 


a\/2 


0 1—i \/a^ — l — l 

0 a —a 

ay/2 0 0 


By means of these transformations we may assume that 


3?(y4) = diag[a, a, —a] and 


0 0 0 

i 0 Va^ —1 i 

0 — i —Va^ —1 


Now it is obvious that for all e > 0 the matrix 


diag[a, a, -a] -h ^ 


0 0_ e 

0 y/a '^ — l i 

e — i —V a'^ — 1 


is unitarily irreducible and Rem. I4.3f 3) shows that it lies in Since 
the matrix converges to A for e —?■ 0, the proof is complete. □ 


Lemmas 14.4115.41 and 15.51 combined yield the following. 

Theorem 5.6. The set 83 n is the subset of all reducible matrices 
A G 7^3 where W{A) is a point, a segment, or an ellipse with the 
normal eigenvalue of A on the boundary. 
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